Chinese dialect identification using an acoustic-phonotactic model
نویسندگان
چکیده
In this paper we develop hidden Markov model (HMM) based approaches to identify Chinese dialects spoken in Taiwan. This task can be aided by exploiting various characteristic features of Chinese spoken languages. The baseline system performs phonotactic analysis after the speech utterance is tokenized into a sequence of five broad phonetic classes. The sequential statistics of the resulting symbols are then used to distinguish one dialect from another. The second approach we tested is to incorporate dialect-dependent phonotactic constraints into the phonetic tokenization rather than applying these constraints after the broad phonetic classification is complete. These algorithms were evaluated using a multispeaker speech corpus of text-independent spontaneous speech data. Simulation results indicate that the acousticphonotactic approach to dialect identification yields better performance with an average identification rate of 89.6%, compared to 70% for the baseline system.
منابع مشابه
Multi-view Dimensionality Reduction for Dialect Identification of Arabic Broadcast Speech
In this work, we present a new Vector Space Model (VSM) of speech utterances for the task of spoken dialect identification. Generally, DID systems are built using two sets of features that are extracted from speech utterances; acoustic and phonetic. The acoustic and phonetic features are used to form vector representations of speech utterances in an attempt to encode information about the spoke...
متن کاملLanguage Recognition for Dialects and Closely Related Languages
This paper describes our development work to design a language recognition system that can discriminate closely related languages and dialects of the same language. The work was a joint effort by LIMSI and Vocapia Research in preparation for the NIST 2015 Language Recognition Evaluation (LRE). The language recognition system results from a fusion of four core classifiers: a phonotactic componen...
متن کاملChinese dialect identification using segmental and prosodic features.
Several approaches to Chinese dialect identification based on segmental and prosodic features of speech are described in this paper. When using segmental information only, the system performs phonotactic analysis after speech utterances have been tokenized into sequences of broad phonetic classes. The second scheme comprises prosodic models which are trained to capture tone sequence information...
متن کاملParallel Acoustic Model Adaptation for Improving Phonotactic Language Recognition
In phonotactic language recognition systems, the use of acoustic model adaptation prior to phone lattice decoding has been proposed to deal with the mismatch between training and test conditions. In this paper, a novel approach using diversified phonotactic features from parallel acoustic model adaptation is proposed. Specifically, the parallel model adaptation involves independent mean-only an...
متن کاملComparing different model configurations for language identification using a phonotactic approach
In this paper different model configurations for language identification using a phonotactic approach are explored. Identification experiments were carried out on the 11-language telephone speech corpus OGI-TS, containing calls in French, English, German, Spanish, Japanese, Korean, Mandarin, Tamil, Farsi, Hindi, and Vietnamese. Phone sequences output by one or multiple phone recognizers are res...
متن کامل